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One of the biggest challenges in biology is to understand how activity at the cellular level of 
neurons, as a result of their mutual interactions, leads to the observed behavior of an organism 
responding to a variety of environmental stimuli. Investigating the intermediate or mesoscopic level 
of organization in the nervous system is a vital step towards understanding how the integration 
of micro-level dynamics results in macro-level functioning. The coordination of many different co- 
occurring processes at this level underlies the command and control of overall network activity. In 
this paper, we have considered the somatic nervous system of the nematode Caenorhabditis elegans, 
for which the entire neuronal connectivity diagram is known. We focus on the organization of the 
system into modules, i.e., neuronal groups having relatively higher connection density compared to 
that of the overall network. We show that this mesoscopic feature cannot be explained exclusively 
in terms of considerations such as, optimizing for resource constraints (viz., total wiring cost) and 
communication efficiency (i.e., network path length). Even including information about the genetic 
relatedness of the cells cannot account for the observed modular structure. Comparison with other 
complex networks designed for efficient transport (of signals or resources) implies that neuronal 
networks form a distinct class. This suggests that the principal function of the network, viz., 
processing of sensory information resulting in appropriate motor response, may be playing a vital 
role in determining the connection topology. Using modular spectral analysis we make explicit the 
intimate relation between function and structure in the nervous system. This is further brought out 
by identifying functionally critical neurons purely on the basis of patterns of intra- and inter-modular 
connections. Our study reveals how the design of the nervous system reflects several constraints, 
including its key functional role as a processor of information. 



I. AUTHOR SUMMARY 

Why are brains wired in the manner that is observed in 
nature? To address this, we analyze the structure of the 
Caenorhabditis elegans neuronal network at an interme- 
diate level of its organization into groups of densely con- 
nected neurons. These structural units, called modules, 
are composed of multiple neuronal types which rules out 
the possibility that this is a straightforward partition of 
the network into sensory and motor regions. Nor do they 
have a simple anatomical interpretation (e.g., in terms of 
ganglia). We investigate the possibility that this mod- 
ular structure developed through an evolutionary drive 
to optimize a nervous system subject to different con- 
straints. We show that neither wiring cost minimization 
nor maximization of communication efficiency are ade- 
quate in explaining the observed patterns in the network 
topology. The distinction with other complex networks 
designed for efficient signal propagation (such as the In- 
ternet) suggests that the neuronal network's key func- 
tional requirement of information processing may play a 
crucial role in determining its overall structure. Analyz- 
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ing the network organization at the mesoscopic level also 
allows us to identify neurons belonging to known func- 
tional circuits and predict those that may play a vital 
part in some as yet undetermined function. 

II. INTRODUCTION 

The relatively simple nervous systems of invertebrate 
organisms provide vital insights into how nerve cells inte- 
grate sensory information from the environment, result- 
ing in a coordinated response. Analysing the intermedi- 
ate or mesoscopic level of organization in such systems 
is a crucial step in understanding how micro-level ac- 
tivity of single neurons and their interactions eventually 
result in macro-level behavior of the organism The 
nematode Caenorhabditis elegans is a model organism on 
which such an analysis can be performed, as its entire 
neuronal wiring layout has been completely mapped 0] ■ 
This information enables one to trace in full the course of 
activity along the neuronal network, from sensory stim- 
ulation to motor response We study its somatic ner- 
vous system, comprising 282 neurons that control all ac- 
tivity except the pharyngeal movements. This can lead to 
an understanding of the command and control processes 
occurring at the mesoscopic level that produce specific 
functional responses, including avoidance behavior and 
movement along a chemical gradient. The neuron loca- 
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tions as well as their connections being completely de- 
termined by the genetic program, are invariant across 
individual organisms Q. Further, unlike in higher or- 
ganisms, the connections do not change with time in the 
adult nematode 4]. In combination with the possibility 
of experimenting on the role of single neurons in differ- 
ent functional modalities, these invariances allow one to 
uniquely identify the important neurons in the system 
having specific behavioral tasks. 

The recent developments in the theory of complex 
graphs has made available many analytical tools for 
studying biological networks [!, @ . The initial emphasis 
was on developing gross macroscopic descriptions of such 
systems using measures such as average path length be- 
tween nodes of the network, the clustering among nodes 
and the degree sequences (the number of links per node) . 
However, such global characterizations of systems ignore 
significant local variations in the connection topology 
that are often functionally important. Therefore, investi- 
gating the network at a mesoscopic level which consider 
the broad patterns in the inhomogeneous distribution of 
connections, may reveal vital clues about the working of 
an organism that could be hidden in a global analysis. 
Further, these large-scale features help in understanding 
how coordination and integration occurs across different 
parts of the system, in contrast to a study of microscopic 
patterns comprising only a few neurons, e.g., motifs [7|. 

The existence of modules, marked by the occurrence of 
groups of densely connected nodes with relatively fewer 
connections between these groups Q, provides a natu- 
ral meso- level description of many complex systems 
In biological networks, such modular organization has 
been observed across many length scales, from the intra- 
cellular protein-protein interaction system [l(| ITT| t o 
food webs comprising various species populations [12j . 
The relation between certain modules and specific func- 
tions, e.g., in metabolic networks [l3j], helps us to un- 
derstand how different functions are coordinated in the 
integral performance of complex biological networks. 

Modular organization in the brains of different species 
have been observed, both in functional networks derived 
from EEG/MEG and fMRI experiments and in struc- 
tural networks obtained from tracing anatomical connec- 
tions [lij • Functionally defined networks are constructed 
by considering brain areas, comprising a large number of 
localized neuronal groups, which are linked if they are 
simultaneously active. Such systems have been shown to 
be modular for both human [l^, and non-human R^l 
subjects. Tract-tracing studies in the brains of cat [18[ 
and macaque fl9j have also revealed a modular layout in 
the structural inter-connections between different corti- 
cal areas. However, as neurons are the essential build- 
ing blocks of the nervous system, ideally one would like 
to explore the network of interconnections between these 
most basic elements. In the extremely complicated mam- 
malian brains, it is so far only possible to analyze such 
networks for extremely limited regions that do not give a 
picture of how the system behaves as a whole [Icj . The 



relative simplicity of the nervous system of C. elegans al- 
lows a detailed analysis of the network, defined in terms 
of both electrical (gap junctional) and chemical (synap- 
tic) connections between the neurons (Fig. [T|). 




1 2 3 4 5 6 78 9 10 12 3 4 5 6 78 9 10 

Ganglion Ganglion 



FIG. 1: Neuronal position and connectivity in the so- 
matic nervous system of the nematode C. elegans in- 
dicating the different ganglia, (a) Schematic diagram of 
C. elegans, indicating the different ganglia. (Inset) Schematic 
representation of connectivity between the neurons, parti- 
tioned into a strongly connected component (SCC), an in- 
component (IN), and an out-component (OUT). A directed 
path exists from any neuron in IN to any neuron in OUT 
through neurons in SCC, all of whose members can be reached 
from each other. The large SCC suggests that it is possible 
to transfer signals between almost all neurons of the network. 
The IN and OUT components have only 1.5% and 0.5%, re- 
spectively, of the 279 connected neurons in the somatic ner- 
vous system, (b, c) The connectivity matrix corresponding to 
the (b) Synaptic and (c) Gap-junctional connections between 
the somatic system neurons. In all figures, the partition sym- 
bols correspond to (Gl) Anterior, (G2) Dorsal, (G3) Lateral, 
(G4) Ventral, (G5) Retrovesicular, (G6) Posterolateral, (G7) 
Preanal, (G8) Dorsorectal and (G9) Lumbar ganglion, and 
(G10) the Ventral cord. 

The ubiquity of modularity in brain networks leads to 
the obvious question about how to explain the evolution 
of such a structural organization 21]. One possible rea- 
son for the existence of modular architecture is that it 
may result in low average path length (which is associ- 
ated with high efficiency of signal communication) and 
high clustering (that allows local segregation of informa- 
tion processing) in networks (22] . An alternative possibil- 
ity is that segregation of neurons into spatially localized 
communities minimizes the total cost associated with the 
wiring length (the physical distance spanned by connec- 
tions between neurons). This cost arises from resources 
associated with factors such as wiring volume as well as 
metabolism required for maintenance and propagation of 
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signals across long distances [23j. Developmental con- 
straints, such as the lineage relations between different 
neurons may also play an important role in determining 
the connection topology of the neuronal network [24j • In 
addition, the existence of empirically determined circuits 
responsible for specific functions (such as, movement as- 
sociated with exploratory behavior, egg laying, etc.) in 
the C. elegans nervous system, raises the intriguing pos- 
sibility that structurally defined modules are associated 
with definite functional roles 25] . The invariant neuronal 
connectivity profile of C. elegans allows us to explore the 
contributions of the above mentioned structural, develop- 
mental and functional constraints in governing the meso- 
scopic organization of the nervous system. 

In this paper, we begin our analysis of the organi- 
zation of the C. elegans nervous system by identifying 
structurally defined modules in the network of neurons 
linked by synapses and gap-junctions. Next, we inves- 
tigate whether the observed modular structure can be 
explained by using arguments based on universal prin- 
ciples. Such criteria, which include minimizing the cost 
associated with neuronal connections [23j, 26J and their 
genetic encoding [13], or, decreasing the signal propaga- 
tion path 28, 29], have recently been proposed to explain 
observed patterns of neuronal position and connectivity. 
Complementing the above studies, we seek to understand 
the factors determining the topological arrangement of 
the nervous system, given the physical locations of the 
neurons. We determine the role of physical proximity 
between a pair of neurons in deciding the connection 
structure, by investigating the correlation between their 
spatial positions and their modular membership. We also 
compare these modules with the existing classification of 
the nematode nervous system into several ganglia, as the 
latter have been differentiated in terms of anatomical lo- 
calization of their constituent neurons. Results from the 
above analysis suggests that resource constraints such as 
wiring cost cannot be the sole deciding factor governing 
the observed meso-level organization. We also show that 
the modules cannot be only a result of the common lin- 
eage of their member nodes. 

It is natural to expect that the structure of the nervous 
system is optimized to rapidly process signals from the 
environment so that the organism can take appropriate 
action for its survival [3(J. By looking at the deviation 
between the actual network and a system optimized for 
maximal communication efficiency in conjunction with 
minimum wiring cost, we infer the existence of additional 
functional constraints possibly related to processing of in- 
formation (i.e., other than rapid signal transmission). In- 
formation processing refers to the transformation of sig- 
nals [31(, such as selective suppression of activation in 
specific pathways, which allows different sensory stimuli 
to initiate response in distinct sets of motor neurons. Ab- 
sence of such transformations would result in non-specific 
arousal of a large number of connected neurons, and will 
not achieve the high level of control and coordination 
necessary for performing a variety of specific functional 



tasks. This is also related to the observation of rela- 
tively high clustering in C. elegans neuronal network as 
compared to other information networks (e.g., electronic 
logic circuits (32[)- Previous investigation of the role of 
clustering in the performance of neural network models in 
associative recall of stored patterns, has shown that lower 
clustering exhibits much better performance (33| . It has 
also been shown that the clustering in C. elegans neuronal 
network is higher than that in degree-conserved random- 
ized networks having the same wiring cost [Hj]. There- 
fore, the presence of enhanced clustering in a system that 
has evolved under intense competition for survival may 
imply it plays a key role in processing information. 

This brings us to the possibility that the observed dis- 
tribution of neurons among modules is closely related 
to the behavioral requirements of the organism. For this 
purpose, we investigate the relation between the modules 
and the different functional circuits governing specific be- 
havioral aspects of C. elegans. We identify the functional 
importance of certain key neurons by observing their rel- 
ative connectivity within their module as compared to 
that with neurons belonging to other modules. By look- 
ing at the correlation between local and global connectiv- 
ity profiles of individual neurons, we observe that the ne- 
matode nervous system is different from systems designed 
for rapid signal transfer, including other information net- 
works occurring in the technological domain, such as the 
Internet. Further, in contrast to previous observations on 
the similarity between biological signalling networks hav- 
ing different origins [3J,|35|], we find that the C. elegans 
neuronal network has properties distinct from at least 
one other biological network involved in signalling, the 
protein interaction network (viz., in terms of the assor- 
tativity and the role-to-role connectivity profile) [H, H3] ■ 

Thus, the analysis of the network at the mesoscopic 
level provides an appropriate framework for identifying 
the roles that different classes of constraints (develop- 
mental, structural and functional) play in determining 
the organization of a nervous system. It allows us to 
infer the existence of criteria related to processing of in- 
formation governing the observed modular architecture 
in C. elegans neuronal inter-connections. It also provides 
the means for identifying neurons having key roles in the 
behavioral performance of the organism exclusively from 
anatomical information about their structural connectiv- 
ity. Our results can help experimentalists in focusing 
their attention to a select group of neurons which may 
play a vital part in as yet undetermined functions. 



III. RESULTS 

A. Modular structure of the C. elegans somatic 
nervous system 

We begin our study of the mesoscopic organization of 
the network by focusing on identifying its modular ar- 
rangement. In order to determine the community struc- 



4 



ture of the C. elegans neuronal network, we perform an 
optimal partitioning of the system into modules, that cor- 
responds to the maximum value of modularity parameter, 
Q (see Methods for details on modularity measure Q and 
the algorithm used for modularity determination). We 
have considered different cases corresponding to the dif- 
ferent types of neuronal connections (viz., gap junctions 
and synapses) and the nature of such connections (i.e., 
unweighted or weighted by the number of connections be- 
tween a given pair). While the gap junctional network is 
undirected, the directional nature of signal propagation 
through a synapse implies that the synaptic network is 
directed. For each network, we have obtained using the 
spectral method, the maximum modularity Qm and the 
corresponding number of partitions (Table|l|. Note that, 
the number of modules and their composition is depen- 
dent on the type of connections we consider. 



As we want to consider all connections in our study, 
we have also worked with an aggregate network that in- 
cludes synapses as well as gap junctions. Throughout 
this paper, we have reported results for this weighted 
combined network, unless stated otherwise (see Text SI 
for the analysis of the network of synaptic connections). 
The link weights in the combined network correspond to 
the total number of synaptic and gap junctional connec- 
tions from one neuron to another. The high value of 
Qm and dense inter-connectivity within modules (Fig. [21 
a) suggest that the network has a modular organization 
(Table SI). We further validate our results by calculat- 
ing the modularity of randomized versions of the network 
where the degree of each node is kept fixed (see Methods 
for the network randomization procedure). The average 
modularity of these randomized networks is considerably 
lower than that of the empirical network. We examine 
the robustness of the modular partitioning with respect 
to the method used for detecting communities by using 
another module determination algorithm 38]. However, 
this produces a lower value for Q{— 0.434) compared to 
the spectral method. The three modules identified us- 
ing this alternative method have a substantial overlap 
(normalized mutual information / = 0.496) with the six 
modules obtained using the spectral method. It suggests 
that the alternative algorithm yields a coarser-grained 
picture of the network communities that are considered 
in the rest of the paper. 



The modules do not have a simple relation with the 
anatomical layout of the worm. In particular, they are 
not a result of a simple division of the nervous system 
into groups responsible for receiving sensory input, and 
other groups involved in motor output. In Fig. [5](b), we 
have analyzed the composition of the different modules in 
terms of distinct neuron types (viz. sensory, motor and 
inter-neurons). None of the modules are exclusively com- 
posed of a single type of neuron, although motor neurons 
do tend to dominate one module. 




FIG. 2: Modular interconnectivity and decomposition 
according to neuron type, (a) Matrix representing the 
average connection density between neurons occurring within 
modules and those in different modules. The figure indicates 
that neurons within a module are densely interconnected com- 
pared to the overall connectivity in the network, (b) The 
modules are decomposed according to the different neuron 
types comprising them. The figure shows that the modules 
are not simply composed of a single type of neuron. 



B. Modules and spatial localization 

To understand why modular structures occur in the 
neuronal network, we first consider the relation between 
the optimal partitions and the spatial localization of neu- 
rons in each module. This can tell us whether constraints 
related to the physical nearness between neurons, such as 
minimization of the wiring length, dictate the topologi- 
cal organization of the network. Wiring cost has already 
been shown to be the decisive factor governing neuron po- 
sitions in the body of C. elegans [23, 26]. Thus, a plausi- 
ble hypothesis is that, if most neuronal connections occur 
within a group of neurons which are physically adjacent 
to each other, then the wiring cost will be significantly 
decreased. In terms of connectivity, this will be mani- 
fested as a modular organization of the network, where 
each module will mostly comprise neurons in close phys- 
ical proximity. 

Fig. [3] indicates the spatial location of the cell body 
for each neuron on the nematode body (along the longi- 
tudinal axis), segregated according to their membership 
in different modules. We see that a large fraction of the 
neurons belonging to the same module do indeed have 
their cell bodies close to each other. A one-way ANOVA 
test, comparing the positions of the neurons in the dif- 
ferent modules with the null hypothesis that they are 
drawn from the same population, shows that it can be 
rejected at confidence level of 99.9%. However, the large 
standard deviations for the distribution of positions of 
the module components reveal that none of the modules 
are spatially localized at any specific region on the ne- 
matode body axis. A more detailed multiple comparison 
procedure carried out for every pair of modules shows the 
absence of statistically significant spatial separation for 
the module pairs (I, III), (I, V), (II, VI), (III, IV), (III, 
V), (III, VI) and (IV, VI). The lack of significant dis- 
tinction between the modules in terms of their physical 
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TABLE I: Modularity of the C. elegans neuronal network. 
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The modularity of the network is measured using the parameter Q, which requires a knowledge of the partitions or 
communities which divide the network. We obtain the modularity measure, Q g , on assuming the communities to correspond 
to the ganglia. Its positive values indicate that neurons in the same ganglion have high density of inter-connections. We have 
also obtained Q by determining the modules of the network using a spectral method, the corresponding values being indicated 
by Qm- The relatively high values of Qm compared to Q g , indicates that the ganglia do not match with this optimal 
partitioning of the network. The measures, Q g and Qm, as well as the number of modules, um, have been obtained for both 
unweighted and weighted networks consisting of either gap junctions or synapses or both. We calculate the overlap between 
the ganglionic and the optimal partition of the network using the normalized mutual information index, /. For the case of 
perfect match between the two, the index, 1=1, whereas if they are independent of each other, 1 = 0. The measured values 
of / indicate that the overlap between the different modules and the anatomically defined ganglia is not high. The modular 
nature of the somatic nervous system is emphasised by comparing the empirical network with networks obtained by 
randomizing the connections, keeping the degree of each neuron fixed. The mean and standard deviation of the modularity 
Q™ d and the corresponding number of partitions m™ d are shown for both weighted and unweighted networks, and for the 
different types of connections. For all cases, the randomized networks show a significantly lower modularity than the 
empirical network. 
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FIG. 3: Neuronal layout of the worm indicating cell 
body positions of each neuron. The position of neuronal 
cell bodies along the longitudinal axis of the C. elegans body 
plan is shown, with the vertical offset and color indicating 
the module to which a neuron belongs. The mean and stan- 
dard deviation of neuronal positions for each module is also 
indicated, suggesting relative absence of spatial localization 
in the modules. 

location weighs against the hypothesis of wiring length 
minimization being the dominant factor governing the 
connectivity. 

The above conclusion is further supported by analysing 
the connectivity pattern of the different ganglia of the 
nematode nervous system. The nine anatomically de- 
fined ganglia (Gl: Anterior, G2: Dorsal, G3: Lateral, 
G4: Ventral, G5: Retrovesicular, G6: Posterolateral, G7: 
Preanal, G8: Dorsorectal and G9: Lumbar), in addition 
to the ventral cord (G10), are defined in terms of physi- 
cal proximity of their component neurons. Thus, a lower 
total connection length between neurons would result in 
the ganglia having a relatively higher density of connec- 
tions between their constituent neurons. This would im- 
ply that the existence of ganglia imposes a modular struc- 



ture in the connection topology. To examine how well the 
ganglionic arrangement explains the observed modularity 
of the neuronal network, we have measured the modular- 
ity value Q g where the network communities correspond 
to the different ganglia. Although the non-zero value 
of Q g indicates that the connection density between the 
neurons in a ganglion is higher than that for the overall 
network, it is not as high as the maximum possible value 
of Q (= Qm, obtained for the optimal partitioning) as 
seen from Table |U To measure the overlap between the 
modules obtained by optimal partitioning of the network 
and the ganglia, we calculate the normalized mutual in- 
formation index, I (see Methods). In the case of perfect 
match between the two, 7 = 1, while, if there is no over- 
lap, I — 0. The low values for I given in Table Q] suggest 
that the composition of the different ganglia is quite dis- 
tinct from that of the modules. The overlap between the 
modules and the ganglia is shown explicitly in Fig. [4] (a) , 
indicating that most ganglia are composed of neurons 
belonging to many different modules. 

This distribution of the neurons of each ganglion into 
the m different modules of the optimal partition allows 
us to define a modular decomposition spectrum for the 
different ganglia. It gives us a metric for inter-ganglionic 
distance in a m-dimensional "modular" space. Thus, in 
this abstract space, two ganglia are close to each other if 
they have similar spectral profiles. Their distance in this 
"modular" space (Fig. [4] b) are then compared to their 
physical distance, as measured in terms of the average 
separation between the cell bodies of all pairs of neu- 
rons i and j, belonging to different ganglia (Fig. 21 c). 
The comparison of the two distance matrices shows that 
there are indeed certain similarities between these two 
different concepts of closeness between the ganglia. For 
example, the five ganglia located in the head (G1-G5) 
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FIG. 4: Modular decomposition of neurons in differ- 
ent ganglia, (a) Neurons belonging to different ganglia are 
decomposed according to their modular membership. The 
height of each bar in the histogram corresponds to the over- 
lap between the ganglia and the modules, calculated as the 
fraction of neurons that are common to a particular ganglion 
and a specific module, (b) The matrix representing the aver- 
age modular distance between the different ganglia, as calcu- 
lated from the modular decomposition spectrum of each gan- 
glion. The corresponding dendrogram indicates the closeness 
between different ganglia in the abstract 6-dimensional "mod- 
ular" space, (c) The matrix of physical distances between the 
ganglia is shown for comparison with (b), calculated as the 
average distance between neurons belonging to the different 
ganglia. The corresponding dendrogram indicates the close- 
ness between ganglia according to the geographical nearness 
of their constituent neurons in the nematode body. The dif- 
ference indicates that the ganglia which are geographically 
close may not be neighbors in terms of their modular spectra. 



Ganglia which are close to each other in physical space 
may not be neighbors in terms of their modular spec- 
tra. For instance, G5 which is located in the head, is 
closer in "modular" space to the ganglion G8 located in 
the tail. On computing the correlation coefficient be- 
tween the two trees by considering the distances between 
every pair of ganglia (measured as the path length be- 
tween the pair in the dendrogram), we obtain r = 0.516 
(p < 0.0003). This value is substantially lower than 1, 
the value expected had the two dendrograms been identi- 
cal. It reiterates our previous conclusion that wiring cost 
minimization, which is related to the physical distance 
between neurons, is not a dominant factor governing the 
organization of C. elegans somatic nervous system. 



C. Modules and cell lineage 

As developmental processes are believed to play a crit- 
ical role in determining the structure of the nervous sys- 
tem, we also consider the alternative hypothesis that the 
structural modules reflect a clustering of neurons that 
are related in terms of their lineage. Lineage of a cell 
is the pattern of successive cellular divisions that occur 
during its development. This is invariant in C. elegans, 
allowing one to trace the individual developmental his- 
tory of each cell in order to identify the cell-autonomous 
mechanisms and inter-cellular interactions that define its 
fate [3j . We investigate whether a relation exists between 
the modular structure and the sequence of cell divisions 
that occur during development, by measuring the aver- 
age relatedness between neurons occurring in the same 
module and comparing with that for neurons occurring 
in different modules. 
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cluster together, as do the three located towards the tail 
(G7-G9). This similarity can be quantified by comput- 
ing the correlation between these two distance measures 
(i.e., Euclidean and modular), r = 0.564 (p < 0.0001). 
Our observation is in accord with previous reports which 
use the notion of wiring cost for explaining (to a cer- 
tain extent) the observed relative positions of the gan- 
glia [39|]. However, when we consider the correspond- 
ing dendrograms that indicate the relative proximity of 
the different ganglia in physical space and in "modular" 
space, we observe significant differences between the two. 



FIG. 5: Lineage distance between modules. The ma- 
trix representing the average lineage distance between neurons 
occurring within the same module and those belonging to dif- 
ferent modules. The figure indicates that neurons occurring 
in the same module have only a slightly lower lineage distance 
as compared to that between neurons occurring in different 
modules. 

Fig. [5] indicates that it is difficult to distinguish the 
modules in terms of the lineage of the neurons com- 
prising them. Indeed, even coarse distinctions such as 
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AB and non-AB lineage neurons are not apparent from 
the modular division. A one-way AN OVA test, with the 
null hypothesis that the average lineage distance between 
neurons within the same module and that between neu- 
rons belonging to different modules are obtained from 
the same distribution, shows that it cannot be rejected 
at confidence level of 99%. We next analyse each pair 
of modules using a multiple comparison procedure for 
the intra-modular and inter-modular lineage distances of 
their constituent neurons. This reveals that the neurons 
belonging to modules III and VI have within-module lin- 
eage distance distribution that is statistically indistin- 
guishable from the distribution of their lineage distance 
with neurons belonging to any of the other modules. 
Thus, for at least two of the modules, one cannot seg- 
regate them in terms of cell lineage. The detailed view 
of the relatedness between each pair of neurons shown 
in Fig. SI indicates that, while in each module there 
are subgroups of closely related neurons, different sub- 
groups within the same module may be very far from 
each other in the lineage tree. Conversely, neurons oc- 
curring in different modules can have small distance be- 
tween each other in terms of lineage. This observation is 
supported by the low correlation r = 0.076 (p < 0.0001) 
between the physical and lineage distances of neurons. 
The fact that C. elegans neurons are largely non-clonally 
derived from many different parent cells |40( may partly 
explain this lack of correlation between lineage and mod- 
ules. The above results indicate that developmental con- 
straints arising from common ancestry are not exclusively 
responsible for the observed connection structure of the 
C. elegans neuronal network. 



D. Optimizing between wiring cost and 
communication efficiency 
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FIG. 6: Trade-off between wiring cost and communi- 
cation efficiency in the network. The variation of com- 
munication efficiency, E, as a function of the wiring cost, de- 
fined using either the "dedicated-wire" model (DW) or the 
"common- wire" model (CW), in the ensemble of random net- 
works with degree sequence identical to the C. elegans neu- 
ronal network. The trend indicates a trade-off between in- 
creasing communication efficiency and decreasing wiring cost. 
The corresponding values for the empirical network are indi- 
cated by crosses for both DW and CW. The schematic figures 
shown above the main panel indicate the type of networks 
obtained in the limiting cases when only one of the two con- 
straints are satisfied. In both curves, error bars indicate the 
standard deviations calculated for 10 3 random realizations. 
We observe that the empirical network is suboptimal in terms 
of wiring cost and communication efficiency, suggesting the 
presence of other constraints governing the network organiza- 
tion. 



In the previous section, we have shown that neither 
wiring cost minimization nor lineage considerations can 
by themselves determine the connection topology of the 
network. In order to ascertain the possible nature of 
the additional constraints that gives rise to the observed 
mesoscopic structure, we now investigate global proper- 
ties of the neuronal network. A possible governing factor 
for network organization is that, rather than decreasing 
the total wiring length or the average physical distance 
between connected neurons, the network minimizes the 
path length for information transfer. This can be mea- 
sured by the number of links that must be traversed to go 
from one neuron to another using the shortest route [29( . 
We consider this possibility by measuring the communi- 
cation efficiency of the network, using the harmonic aver- 
age path length between all pairs of neurons (see Meth- 
ods). 

It is evident that increasing the efficiency requires 
topological long-range connections, which however in- 
creases the wiring cost of the network (Fig.|5]top). There- 
fore, it is natural to expect that the system would try to 



optimize between these two constraints. Thus, we com- 
pare the performance of the network as a rapid signal 
propagation system against the resource cost for the re- 
quired number of connections. This cost is measured as 
the Euclidean length between the cell bodies of all con- 
nected pairs of neurons, corresponding to the "dedicated- 
wire" model of Ref. [23]. It has been shown that the posi- 
tions of the subset of sensory and motor neurons directly 
connected to sensory organs and muscles, respectively, 
can be determine d q uite accurately by minimizing their 
total wiring cost [26|. As our focus is on the connection 
structure of the neuronal network, we keep the neuron 
positions invariant. By randomizing the network, keep- 
ing the degree of each node unchanged, we can construct 
a system with a specific wiring cost. We then measure its 
communication efficiency Fig. [^reproduces the expected 
result that, decreasing the wiring cost of the network 
causes a decline in its performance in terms of its abil- 
ity to propagate signals rapidly. However, it is surprising 
that the empirical network has a wiring cost much higher 
than that of the corresponding randomized network hav- 
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ing the same communication efficiency. To see whether 
this couid be an artifact of the measure used to calculate 
wiring cost, we have considered an alternative method 
for quantifying it. 

Most of the neurons in C. elegans have at most one or 
two extended processes, on which all the synapses and 
gap junctions with other neurons are made. Thus the 
"dedicated- wire" definition of wiring cost that sums Eu- 
clidean distances between every connected neuronal pair 
may be a gross over-estimate of the actual usage of re- 
sources used in wiring. Instead, we can use a "common- 
wire" model to define the wiring cost for connecting to 
a specific neuron. This is measured by the Euclidean 
distance between the neuron's cell body and that of the 
farthest neuron (along the longitudinal axis) it is con- 
nected to. The simple one-dimensional simplification of 
the C. elegans body that we have assumed here ignores 
distance along the transverse plane. Thus, this mea- 
sure is actually an under-estimate of the actual wiring 
cost, and should provide an insightful comparison with 
the above measure obtained from the "dedicated-wire" 
model. Fig. [5] (inset) shows that wiring cost increases 
with communication efficiency for the randomized net- 
works, which is qualitatively similar to the relation ob- 
tained using the preceding definition for wiring cost. In 
this case also, we find that the empirical C. elegans net- 
work has a much lower efficiency in comparison with an 
equivalent randomized network having the same wiring 
cost. Thus, as this observation is independent of these 
two definitions of wiring cost, it suggests the presence of 
other constraints that force the neuronal network to have 
a higher wiring cost or lower efficiency than we would 
have expected. These constraints are possibly related to 
information processing, which is the principal function of 
the nervous system. 



E. Information processing is a distinctive feature of 
the C. elegans neuronal network 

To explore further the possibility that the additional 
constraints governing the topological structure of C. ele- 
gans nervous system may be related to information pro- 
cessing, we investigate how this functional requirement 
could be responsible for differentiating the system from 
other complex networks for which communication effi- 
ciency is of paramount importance. Rapid communica- 
tion of information between different neurons is certainly 
an important performance criterion. However, the neu- 
ronal network has properties quite distinct from that of 
(say) the Internet or the airline transportation network, 
which are systems designed for maximum transportation 
efficiency of signals or physical resources. 

For this purpose, we look at the overall network de- 
sign by decomposing the system into (i) a strongly con- 
nected component (SCC), within which it is possible to 
visit any node from any other node using directed links, 
(ii) an inward component (IN) and (iii) an outward corn- 
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FIG. 7: Betweenness centrality and the average near- 
est neighbor degree as a function of the total degree of 
network, (a) The average betweenness centrality, (BC), and 
(b) the average nearest neighbor degree, (k nn ) of each node as 
a function of its total degree, (k) — (fe n + fc ou t)- Betweenness 
centrality is a measure of how frequently a particular node is 
used when a signal is being sent between any pair of nodes 
in the network using the shortest path. In case of the Inter- 
net, BC of nodes increases with its degree which is sought 
to be linked with its information transport property. In C. 
elegans, although BC increases with degree, this increase is 
not significant when compared to the randomized version of 
the network. In the case of the relation between the aver- 
age connectivity of nearest neighbors of a node with its total 
degree k, we note that for both the Internet and protein in- 
teraction network, k nn decreases with k as a power law. This 
means that low connectivity nodes have high degree nodes as 
their neighbors and vice- versa. However, in the case of C. 
elegans, this relation is not very apparent and insignificant 
in comparison with the randomized version of the network. 
In both figures, error bars indicate the standard deviations 
calculated for 10 3 random realizations. These results suggest 
that the C. elegans network forms a class distinct from the 
class of networks optimized only for signal propagation. 

ponent (OUT), consisting of nodes from which the SCC 
can be visited or which can be visited from the SCC, 
respectively, but not vice versa. In addition, there can 
be disconnected components, i.e., nodes which cannot be 
visited from SCC nor can any visits be made to SCC from 
there (Fig. [TJ. A comparison of the C. elegans neuronal 
network with a similar decomposition of the WWW [4l[ 
reveals that, while in the latter the different components 
are approximately of equal size (WWW: SCC ~ 56 mil- 
lion pages, while IN, OUT consist of ~ 43 million pages 
each), the SCC of the nervous system comprises almost 
the entire network (SCC has 274 neurons, IN has 4 neu- 
rons and OUT has 1 neuron). Thus, any node can, in 
principle, affect any other node in the nervous system, 
suggesting the importance of feedback control for infor- 
mation processing. 

Next, we consider the relation between two fundamen- 
tal properties of the network: the degree of nodes and 
their betweenness centrality (BC), which characterizes the 
importance of a node in information propagation over 
the network (see Methods). We observe that for both 
the C. elegans neuronal network and its randomized ver- 
sions, the degree of a node and its BC are strongly cor- 
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related, i.e., highly connected nodes are also the most 
central (Fig. left). This is similar to what has been 
observed in the Internet [13], where the highest degree 
nodes are also those with the highest betweenness |43j . 
but in sharp contrast to the airport transportation net- 
work, where non-hub nodes (low degree) may have very 
large BC Q. 

However, the C. elegans neuronal network differs from 
other networks whose primary function is to allow sig- 
nal propagation between nodes (viz., the Internet and 
the protein interaction network (PIN)), in terms of the 
variation of the degree of a node with the average degree 
of its neighboring nodes, (k nn ). While in the Internet 
and PIN, (k nn ) decays as a power law with node de- 
gree, in the neuronal network, this dependence is very 
weak (Fig[jj right), especially when contrasted with the 
randomized ensemble. This implies that the C. elegans 
nervous system does not have multiple star-like subnet- 
works as seen in the Internet and PIN (37}. Further, it is 
different from the airline transportation network, where 
the high degree nodes are closely connected among them- 
selves showing an assortative behavior (45[. In fact, com- 
putation of the assortativity coefficient r = —0.093 in- 
dicates that the network is disassortative (as previously 
stated in Refs. @, E3|): although comparison with that 
of the degree-conserved randomized ensemble {r ran d = 
—0.078 ± 0.011) indicates that this is predominantly a 
result of the degree sequence. Another important distin- 
guishing characteristic of the C. elegans network is that 
the distributions of link weight and degree do not appear 
to be scale- free, or even having a long tail (Fig. S2), unlike 
systems such as the Internet and the airline transporta- 
tion network [45|,|48j]. Thus, our study shows that there 
are additional constraints governing the nervous system 
connection topology in C. elegans, which are unrelated to 
wiring cost, lineage or communication efficiency. As the 
principal function of the system is to process information, 
the above results suggest it is this functional requirement 
that provides the additional constraints leading to the ob- 
served organization of the nematode neuronal network. 



F. Modules and functional circuits 

In order to understand the nature of functional consid- 
erations that may govern the network organization, we 
focus on the overlap of several previously identified func- 
tional circuits of C. elegans with the structurally identi- 
fied modules. Functional circuits are a subset of neurons 
which are believed to play a vital role in performing a 
function, and are distinguished by observing abnormal 
behavior of the organism when they are individually re- 
moved (e.g., by laser ablation). In biological systems, 
it has been observed that members of structurally de- 
fined communities are often functionally related (e.g., in 
the intra-cellular protein interaction network [l3j and the 
network of cortical areas in the brain (4!|[5(|). Here, we 
investigate the possibility of a similar correlation between 



the anatomical modules of C. elegans and its functional 
circuits. We consider the functional circuits for (Fl) 
mechanosensation [HHi^, (F2) egg layingpl [H[ , (F3) 
thermotax is |56l| . (F4) chemosensation |57|. (F5) feed- 
ing @, [H il7(F6) exploration @, HJ El] and (F7) tap 
withdrawal^!!,!!! (Table S2). 
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FIG. 8: Modular decomposition of neurons in dif- 
ferent functional circuits. Neurons belonging to different 
functional circuits are decomposed according to their modular 
membership. The height of each bar in the histogram corre- 
sponds to the overlap between the modules and functional 
circuits (Fl) mechanosensation, (F2) egg laying, (F3) ther- 
motaxis, (F4) chemosensation, (F5) feeding, (F6) exploration 
and (F7) tap withdrawal. The overlap is measured in terms 
of the fraction of neurons common to a particular functional 
circuit and a specific module. The corresponding dendrogram 
represents the closeness between different functional circuits 
in the abstract 6-dimensional "modular" space. 

Fig. [3] shows the modular decomposition of each func- 
tional circuit, indicating their overlap with the modules. 
The corresponding dendrogram clusters the circuits in 
terms of the similarity in their modular spectra. We note 
that the circuits for chemosensation, feeding and explo- 
ration are clustered together. This is consistent with the 
fact that most of the neurons belonging to the feeding 
and exploration circuits are involved in chemosensation. 
A surprising observation is that although F6 is a subset 
of F4, it is actually closer to F5 in "modular" space (dis- 
tance = 0.18) than to F4 (distance = 0.26), despite the 
feeding and exploration circuits not having any neuron in 
common. This close relation between F5 and F6 in mod- 
ular space is suggestive of a relation between modularity 
and functionality, as it is known from experiments that 
there is a strong connection between their corresponding 
functions. The feeding behavior of C. elegans is known to 
be regulated in a context-dependent manner by its chem- 
ical milieu. It integrates external signals [60| , such as the 
availability of food, and nutritional status of the animal, 
to direct an appropriate response [6lj |. An example is 
the avoidance of high CO2 concentrations by satiated 
animals (62|. Further, the mode of locomotion of the 
organism is also determined by the quality of food [o3j . 
Another important observation made from the modular 
decomposition is the proximity of the functional circuit 
F2 to the group (F4,F5,F6). This is significant in light of 
experimental observations that presence of food detected 
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through chemosensory neurons modulates the egg-laying 
rate in C. elegans jH, [64{ . The above results indicate 
that the relation between the functional circuits, which 
are essential for the survival of the organism, are reflected 
in the modular organization of the nematode nervous sys- 
tem. 



nodes have higher participation coefficient, with a large 
majority being satellite connectors (R3). More interest- 
ing is the fact that, the empirical neural network does not 
possess any neuron having the global roles played by R4 
and R7, whereas these regions may be populated in ran- 
domized networks. This implies that modular identity in 
the C. elegans neuronal network is very pronounced. 



G. Functional roles of different neurons 

Having looked at the functional circuits and the rela- 
tions between them in the previous section, we now in- 
vestigate the importance of individual neurons in terms 
of their connectivity. This is revealed by a comparison 
between the localization of their connections within their 
own community and their global connectivity profile over 
the entire network. In order to do this, we focus on (i) 
the degree of a node within its module, z, that indicates 
the number of connections a node has to other members 
of its module, and (ii) its participation coefficient, P, 
which measures how dispersed the connections of a node 
are among the different modules 37]. 

A node having low within-module degree is called a 
non-hub (z < 0.7) which can be further classified ac- 
cording to their fraction of connections with other mod- 
ules. Following Ref. [13], these are classified as (Rl) 
ultra-peripheral nodes (P < 0.05), having connections 
only within their module, (R2) peripheral nodes (0.05 < 
P < 0.62), which have a majority of their links within 
their module, (R3) satellite connectors (0.62 < P < 0.8), 
with many links connecting nodes outside their modules, 
and (R4) kinless nodes (P > 0.8), which form links uni- 
formly across the network. Hubs, i.e., nodes having rel- 
atively large number of connections within their module 
(z > 0.7), are also divided according to their participa- 
tion coefficient into (R5) provincial hubs (P < 0.3), with 
most connections within their module, (R6) connector 
hubs (0.3 < P < 0.75), with a significant fraction of links 
distributed among many modules, and (R7) global hubs 
(P > 0.75), which connect homogeneously to all mod- 
ules. This classification allows us to distinguish nodes 
according to their different roles as brought out by their 
intra-modular and inter-modular connectivity patterns 
(Table S3). 

We will now use the above methodology on the C. el- 
egans network in order to identify neurons that play a 
vital role in coordinating activity through sharing infor- 
mation (either locally within their community or globally 
over the entire network). Fig. |H] shows the comparison 
between the empirical network and a corresponding ran- 
domized network (obtained by keeping the degree of each 
node fixed). Results for a randomized ensemble, com- 
paring the number of neurons in each role against that 
for the empirical network, are given in Table S4. We 
immediately notice that the randomized networks have 
relatively very few nodes having the roles Rl and R5, in- 
dicating that the modular nature of the original network 
has been lost. In fact, in the randomized system, most 
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FIG. 9: The role of individual neurons according to 
their intra- and inter-modular connectivity, (a) The 

within module degree z-score of each neuron in the empirical 
neuronal network is shown against the corresponding partici- 
pation coefficient P. The within module degree measures the 
connectivity of a node to other nodes within its own module, 
while the participation coefficient measures its connectivity 
with neurons in the entire network, (b) The corresponding 
result for a randomized version of the C. elegans network 
where the degree of each neuron is kept unchanged is also 
shown. Neurons belonging to the different regions in the P — z 
space are categorised as: (gray) "ultraperipheral nodes," i.e., 
nodes with all their links within their module, (blue) "pe- 
ripheral nodes," i.e., nodes with most links within their mod- 
ule, (pink) "nonhub connector nodes," i.e., nodes with many 
links to other modules, (green) "nonhub kinless nodes," i.e., 
nodes with links homogeneously distributed among all mod- 
ules, (yellow) "provincial hubs," i.e., hub nodes with the vast 
majority of links within their module, (red) "connector hubs," 
i.e., hubs with many links to most of the other modules, and 
(white) "global hubs," i.e., hubs with links homogeneously 
distributed among all modules. The neurons occurring as con- 
nector hubs are identified in the figure. Most of these neurons 
occur in different functional circuits indicating the close rela- 
tion between functional importance and connectivity pattern 
of individual neurons. In addition, the neurons AVKL and 
SMBVL which are predicted to be functionally important are 
separately marked. 

It is possible that, neurons having the role of provin- 
cial hubs may be involved in local coordination of neural 
activity, while, the connector hubs may be responsible 
for integration of local excitations to produce a coher- 
ent response of the entire system. This hypothesis is 
supported by noting that all command interneurons (of 
the class AVA, AVB, AVD, AVE, PVC), which control 
forward and backward locomotion of the worm by reg- 
ulating motor output, play the role of connector hubs. 
In fact, out of the 23 neurons in the class R6, 20 are 
known to belong to different functional circuits. Among 
the rest, although DVA does not belong to any of the 



11 



known circuits, it has recently been identified as being 
involved in mechanosensory response and in its absence, 
the frequency and magnitude of the tap-induced reversal, 
as well as the acceleration magnitude, is diminished [59| . 
The two remaining neurons, AVKL and SMBVL, have 
not been implicated so far in any known functional cir- 
cuit. However, their occurrence in this class suggests 
that they may be important for some, as yet unknown, 
function. This is a potentially interesting prediction that 
may be verified in the laboratory. 

The significance of these results is underlined by a com- 
parison with the randomized network. For instance, in 
the random realization shown in Fig. |H] (b), of the 49 
neurons playing the role of connector or global hubs, 
less than half (viz., 23) actually belong to any of the 
known functional circuits. The appearance of most of the 
command interneurons in the high-z region of both the 
empirical and randomized networks indicates that their 
high overall degree is responsible for their observed role 
of "connecting hubs" . 

We now turn to the 28 neurons which play the role of 
provincial hubs. Half of all the inhibitory D-class mo- 
torneurons (viz., DD1-DD3 and VD1-VD6) are found to 
belong to this class. This is significant as these neurons 
have already been implicated in the ability of the worm 
to initiate backward motion. While they also contribute 
to forward locomotion, previous experiments have shown 
that they are not essential 65[. This fits with our hy- 



pothesis that, R5 neurons are important for local coor- 
dination but may not be crucial for the global integra- 
tion of activity. A pair of excitatory B-class motorneu- 
rons that sustain coordinated forward locomotion in the 
worm also appear as provincial hubs. Of the remaining 
R5 neurons, 9 have been previously identified as belong- 
ing to various functional circuits. It will be interesting 
to verify the functional relevance of the remaining 8 neu- 
rons (OLLL/R, RMDVL/R, SMDVR, RIH, RMDDL/R) 
in the laboratory. Thus, overall, we find a very good cor- 
relation between the connectivity pattern and the func- 
tional importance of different neurons. 

Analysis of neurons having different roles in terms of 
their membership in the different ganglia (Fig. S3) indi- 
cates that the lateral ganglion provides the majority of 
neurons acting as connector hubs (R6). This is consis- 
tent with an earlier study where this ganglion was found 
to be the principal highway for information flowing be- 
tween neurons responsible for receiving sensory stimuli 
and those involved in motor response [66j . To check the 
significance of the above result, we observe the member- 
ship of the set of connector hubs in the corresponding 
randomized network (Fig. S3). While this also shows 
many neurons from the lateral ganglion, unlike in the 
empirical network it has significant representation from 
other ganglia too (e.g., the retrovesicular ganglion). 

We have also carried out an analysis of the frequency 
of links between neurons having different roles, relative 
to the randomized network (Fig. S4). This allows us 
to compare the C. elegans neuronal network with other 



networks involved in (a) transportation and (b) informa- 
tion propagation. It has been shown that networks of 
class (b) shows significant under-representation of links 
between Rl-Rl, R5-R6 and R6-R6, whereas networks 
of class (a) exhibit over- representation of all three [37| . 
These patterns have been related to the occurrence of 
stringy periphery in class (a) and multi-star structures 
in class (b) [37[. In the case of C. elegans, Rl-Rl does 
seem to be over-represented. However, R6-R6 shows very 
little over-representation, while both R5-R6 and R6-R5 
show slight under-representation. This difference in the 
role-to-role connectivity pattern for the nematode ner- 
vous system with the networks in the above-mentioned 
two classes suggests that its structure is not exclusively 
characterized by either a stringy periphery or multiple 
stars. This assumes significance in light of recent work 
distinguishing information (or signalling) networks, such 
as the Internet and protein interactome, on the one hand, 
and transportation networks, such as metabolic and air- 
port networks, on the other, into two classes [37| . Our 
results suggest that neuronal networks which have to pro- 
cess information, in addition to transferring signals, may 
constitute a different category from either of the above 
classes. 



IV. DISCUSSION 

In this paper, we have carried out a detailed analysis 
of the mesoscopic structure in the connection topology of 
the C. elegans neuronal network. Inferring the organizing 
principles underlying the network may give us an under- 
standing of the way in which an organism makes sense of 
the external world. We have focused primarily on the ex- 
istence of modules, i.e., groups of neurons having higher 
connection density among themselves than with neurons 
in other groups. The presence of such mesoscopic orga- 
nization naturally prompts us to ask the reasons behind 
the evolution of these features in the network. 

In lower invertebrates like nematodes, the genome is 
the dominant factor which governs the development of 
the organism, including its nervous system. The neu- 
ronal network structure is formed early in the life-cycle 
of the organism, when most of the cells and their con- 
nections are configured permanently. Although external 
cues may play a role, the relative absence of individual 
variations in the network organization makes C. elegans 
an ideal system for studying how the system has evolved 
to optimize for various constraints, such as minimizing 
resource use and maximizing performance. 

There have been recent attempts at explaining neu- 
ronal position and structural layout of the network by 
using static constraints, such as wiring economy and com- 
munication path minimization. Although we find that 
membership of neurons in specific modules are corre- 
lated with their physical nearness, the empirical network 
is sub-optimal in terms of both the above-mentioned con- 
straints. By comparing the system with other complex 
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networks that have been either designed or have evolved 
for rapid transportation while being subject to wiring 
economy, we find that the C. elegans nervous system 
stands apart as a distinct class. This suggests that the 
principal function of neuronal networks, viz., the pro- 
cessing of information, distinguishes it from the other 
networks considered, and plays a vital role in govern- 
ing its arrangement. Considering the importance of this 
constraint in ensuring the survival of an organism, it is 
natural that this should be key to the organizing princi- 
ples underlying the design of the network. The intimate 
relation between function and structure of the nervous 
system is further brought out by our use of structural 
analysis to distinguish neurons that are critical for the 
survival of the organism. In addition to identifying neu- 
rons that have been already empirically implicated in 
different functions (which serve as a verification of our 
method), we also predict several neurons which can be 
potentially crucial for certain, as yet unidentified, func- 
tions. 

Biological systems are distinguished by the occurrence 
of discrete entities with a distinct function, which are 
often termed as functional modules [9( . For several net- 
works that occur in the biological context (such as that of 
protein interactions) , the components of structural mod- 
ules are seen to be functionally related. This suggests 
that modules provide a framework for relating meso- 
scopic patterns in the connection topology to subsystems 
responsible for specific functions. Although there is no 
unique correspondence between the structural modules of 
C. elegans and the known functional circuits [f57j], we use 
the overlap of the circuits with the modular membership 
of their constituent neurons to discover correlations be- 
tween them. Our results reveal non-trivial association be- 
tween circuits whose corresponding functions are closely 
connected, even when they do not share any common 
neurons. As such relations could not have been revealed 
by a micro-scale study which focuses on individual neu- 
rons and their connections, this result highlights one of 
the significant advantages of investigating the network at 
the mesoscopic level. 

When we compare the nervous system of C. elegans 
with the brains of higher organisms, we observe the mod- 
ular organization of the latter to be more prominent [68j j . 
For example, the network of cortical areas in the cat 
and macaque brains exhibit distinct modules [22l . [69| . 
with each module being identified with specific func- 
tions [HI, H3] • A possible reason for the relatively weak 
modular structure in the nematode could be due to the 
existence of extended processes for the neurons of C. ele- 
gans. Many of these span almost the entire body length, 
an effect that is enhanced by the approximately linear 
nature of the nematode body plan. As a result, con- 
nections are not constrained by the physical distance 
between soma of the neurons, as is mostly the case in 
mammalian brains. It is apparent that such constraints 
on the geographical distance spanned by links between 
nodes (viz., cost of wiring length) can give rise to cluster- 



ing of connections among physically adjacent elements. 
In addition, the small nervous system of C. elegans, com- 
prising only 302 neurons, lacks redundancy. Therefore, 
individual neurons may often have to perform a set of 
tasks which in higher organisms are performed by sev- 
eral different neurons. Thus, functional modularity is 
less prominent in the nematode, as some neurons belong 
to multiple behavioral circuits. 

Another principal distinction between the C. elegans 
nervous system and the brains of higher organisms such 
as human beings, is the relative high connectivity in the 
former (the connection density being C ~ 0.1). By 
contrast, the connectance for human brain is around 
10~ 6 [70], which leads us to the question of how com- 
munication efficiency can remain high in such a sparsely 
connected network. It is possible that the more intricate 
hierarchical and modular structures seen in the brains 
of higher organisms is a response to the above problem. 
The fact that the rate at which the number of neurons 
N increase across species, is not matched by a corre- 
sponding increase in the number of links (which increases 
slower than N 2 ) implies the existence of constraints on 
the latter, which is a resource cost in addition to the ear- 
lier mentioned cost of wiring length. Note that, in the 
present work we have focused on a single level of mod- 
ular decomposition of the nematode neuronal network. 
It is possible that the system may have multiple levels 
of hierarchically arranged inter- nested modules [lH [7l| . 
Investigating the existence of such organization in the 
C. elegans nervous system is a potential topic for future 
exploration. 

Networks provide the scaffolding for the computational 
architectures that mediate cognitive functions. The pat- 
tern of connections of a neuron (or a neuronal cluster) 
defines its functionality not just locally but also as an 
integrated part of the nervous system. This is because 
neurocognitive networks across the evolutionary tree con- 
sist of interconnected clusters of neurons that are simul- 
taneously activated for generating a single or a series of 
cognitive outputs. However, while some of the neurons 
(or clusters of neurons) are essential for the relevant out- 
come, others are ancillary. Thus, they work in a collab- 
orative mode but are not interchangeable, each display- 
ing relative specializations for separate behavioral com- 
ponents. Since the most prominent neuronal pathways 
are those that interconnect components of functional cir- 
cuits, our analysis validates the intrinsic connection be- 
tween network structure and the functions of the ner- 
vous system. The concept that behavioral consequences 
of damaging a region of the brain will reflect the disrup- 
tion of the underlying network architecture may be in- 
tuitive but is particularly important when one considers 
brain dysfunction through neurodegeneration [72j , where 
atrophy is seen to propagate preferentially through net- 
works of functionally related neurons [73| . While the idea 
that damage in one part of the brain can affect other ar- 
eas connected to that region is not new, our work on 
the mesoscopic network organization may be extended 
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to look at disease progression through the network, be- 
yond the current focus on the pathology within individual 
cells. 



We obtain this using a generalization of the spectral 
method [7^, [76[. We first define a modularity matrix 
B, 



V. MATERIALS AND METHODS 

Connectivity Data: We have used information 
about the network connectivity, positions of neurons and 
the lineage distance between cells for the C. elegans ner- 
vous system, from the database published in Ref. [23| and 
available from the online Atlas of C. elegans anatomy 
(www.wormatlas.org). This is an updated and revised 
version of the wiring data originally published in Ref. ■ 
The connectivity between neurons and the positions of 
the neuron cell bodies along the longitudinal axis of the 
worm is reconstructed based on serial section electromi- 
crography. Note that, the new database adds or updates 
about 3000 connections to the previous version. The lin- 
eage data indicates the relatedness between every pair of 
neurons in terms of distances in the embryonic and post- 
embryonic lineage trees. This is measured by identifying 
the last common progenitor cells of the two neurons, and 
then counting the number of cell divisions from this com- 
mon ancestor. Each cell division adds a single unit to the 
total lineage distance, with the initial division from the 
common progenitor counted only once. 

Modularity (Q): To decompose a given network into 
modules (where a module or community is defined as a 
subnetwork having a higher density of connections rela- 
tive to the entire network) we use a method introduced 
in Ref. [zi| ■ We compute a quantitative measure of mod- 
ularity, Q, for a given partitioning of the network into 
several communities, 



Q = 



— V 

2L ^ 

i,3 



Ai 



2L 



(1) 



Here, A is the adjacency matrix (Ay is 1, if neurons i, 
j are connected, and 0, otherwise). The degree of each 
node i is given by ki = X^=i Aij. L is the total number 
of links in the network, S is the Kronecker delta (Sij = 1 , 
if i = j, and 0, otherwise), and Cj is the label of the 
community to which vertex i is assigned. In the case of 
directed and weighted network, the above measure can 
be generalized as 
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where, L 



w 



is the sum of weights of all links 
in the network (Wij is the weight of the link from neuron 
j to neuron i), and the weighted in-degree and out-degree 
of node i are given by sf 1 = J2j Wiy and s° ut 
respectively. 

The optimal partitioning of the network is the one 
which maximizes the modularity measure Q (or Q ). 
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(3) 



To split the network into modules, the eigenvectors cor- 
responding to the largest positive eigenvalue of the sym- 
metric matrix (B + B T ) is calculated and the commu- 
nities are assigned based on the sign of the elements of 
the eigenvector. This divides the network into two parts, 
which is refined further by exchanging the module mem- 
bership of each node in turn if it results in an increase in 
the modularity. The process is then repeated by splitting 
each of the two divisions into further subdivisions. This 
recursive bisection of the network is carried out until no 
further increase of Q is possible. 

Modular spectra: Wc analyze different neuronal 
groups, defined in terms of functions, anatomy (e.g., gan- 
glia), etc., by proposing a decomposition in terms of the 
overlap of their constituent neurons with the different 
modules. Let the set of all neurons be optimally parti- 
tioned into m modules. We then define an overlap ma- 
trix, O, where the rows correspond to the different neu- 
ronal groups, and the columns correspond to the different 
modules. An element of this overlap matrix, Oij, is the 
number of neurons in group i that are from the module 
j. Then, the decomposition of the i-th group in the ab- 
stract m-dimensional basis space formed by the modules 



Qm.,...,^}, where Ni = ^ =1 O lk is the to- 
tal number of neurons in the i-th group. The distance 
between two groups i and k in this "modular" space is 
defined as 



imodular 



\ 



E 



N, 



n 2 



(4) 



Thus, this measure can be used as a metric for closeness 
between different neuronal groups. 

Decomposition of the network into SCC, IN and 
OUT components: In order to determine the Strongly 
Connected Component (SCC) of the network, we first 
calculate the graph distance matrix containing the short- 
est directed path between every pair of nodes in the net- 
work. A finite path length from neuron i to neuron j indi- 
cates the existence of a connected path from one neuron 
to the other. By grouping together all nodes which have 
finite path length with all other members of the group, we 
determine the SCC. In general, one can use Tarjan's algo- 
rithm for SCC determination [j^]. Next, we identify all 
neurons not belonging to SCC but which can be reached 
via a directed path starting from a node in SCC. This 
constitutes the OUT component of the network. Simi- 
larly, the group of neurons which do not belong to SCC 
but which have finite directed path length to a member 
of SCC, constitutes the set of IN neurons. 

Betweenness centrality (BC): To measure the im- 
portance of a node in facilitating communication across 
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a network, we consider how frequently a node is used to 
convey information from any part of the network to any 
other part using the shortest available path. The be- 
tweenness centrality of a node i is defined as the fraction 
of shortest paths between the pairs of all other nodes in 
the network that pass through i [78[ . If the total number 
of shortest paths between nodes j and k is ajk , of which 
Tjk{i) paths pass through node i, then the betweenness 
centrality of node i is 



c B (i) = E 



CTjfc 



(5) 



Communication Efficiency (E): To measure the 
speed of information transfer over the network, one can 
define the efficiency Sij of communication between ver- 
tices i and j to be inversely proportional to the shortest 
graph distance £jj = l/dij\fi,j. Therefore, the efficiency 
of communication across the whole network is 



E(G) = r x = 



1 



\N(N 



(6) 



This is the harmonic mean of graph distances between 
all pairs, which does not diverge even when the network 
is disconnected (79j |. 

Network Randomization: An ensemble of random- 
ized versions of the empirical network is constructed 
keeping the in-degree and out-degree of each node un- 
changed. Each such network is created by rewiring ran- 
domly selected pairs of directed edges, i — > j and k — > I, 
such that, in the randomized network, the corresponding 
directed edges are i — > I and k — > j. However, if these 
new links already exist in the empirical network, this step 
is aborted and a new pair of edges are chosen in order to 
prevent the occurrence of multiple edges [36| . The above 
procedure is repeated 5 x 10 6 times for a single realiza- 
tion of the randomized network. In order to compare the 
properties of the empirical network with its randomized 
version, an ensemble of 10 3 realizations is considered. 

Determining the intra- and inter-modular role 
of a neuron: The role played by each neuron in terms of 
its connectivity within its own module and in the entire 
network is determined according to two properties flij : 
(i) the relative within module degree, z, and (ii) the par- 
ticipation coefficient, P. 

The z-score of the within module degree distinguishes 
nodes that are hubs of their communities from those that 
are non-hubs. It is defined as 



v/<(«£) 2 >ie Ci 
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(7) 



where is the number of links of node i to other nodes 
in its community c and (• • • )jec are taken over all nodes 
in module c. The within-community degree z-score mea- 
sures how well-connected node i is to other nodes in the 
community. 



The nodes are also distinguished based on their con- 
nectivity profile over the entire network, in particular, 
their connections to nodes in other communities. Two 
nodes with same within module degree z-score can play 
different roles, if one of them has significantly higher 
inter-modular connections compared to the other. This 
is measured by the participation coefficient Pi of node i, 
defined as 



P = 1 - > ^ 



(8) 



where K l c is the number of links from node i to other nodes 
in its community c and fcj = ^ c k\ is the total degree of 
node i. Therefore, the participation coefficient of a node 
is close to 1, if its links are uniformly distributed among 
all the communities, and is 0, if all its links are within 
its own community. 

Normalized mutual information (J): To measure 
the overlap of the membership of nodes in a ganglion with 
their membership in a specific module, we use the nor- 
malized mutual information measure [80l l8l|. First, we 
define a overlap matrix, O, where the rows correspond to 
the different ganglia, and the columns correspond to the 
modules obtained by optimal partitioning of the network. 
An element of this overlap matrix, Otj , is the number of 
neurons in the ganglion i that appear in the module j. 
An information-theoretic measure of similarity between 
the partitions can then be defined as 

E™1 Oi. io g (o,/o) + ET=i o, \o g {o,/o) ' w 

where ttia and tub are the numbers of ganglia and mod- 
ules respectively. The sum over row i of matrix Oij is 
denoted by Oi. and the sum over column j is denoted 
by O.j. If the modules are identical to the ganglia, then 
I takes its maximum value of 1. On the other hand, if 
the modular partitioning is independent of the division 
of the network into ganglia, I ~ 0. 
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VII. SUPPORTING INFORMATION 

Figure SI. Matrix representing the relatedness of 
neurons in the somatic nervous system of C. elegans as 
measured in terms of their lineage distance. The neu- 
rons are arranged according to the modules they belong 
to. The module boundaries are indicated in the figure. 
Within each module, neurons that are close in terms of 
lineage are placed in adjacent positions. The figure shows 
that closely related neurons may occur in different mod- 
ules, while those in the same module may be far apart in 
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terms of lineage distance. This indicates that there is no 
simple relation between relatedness of neurons in terms 
of lineage and their modular membership. 

Figure S2. Cumulative distributions of the strength 
and (inset) degree for the (a) gap-junctional, (b) synaptic 
and (c) combined networks. The gap-junctional network 
is undirected and the strength of a node is defined as 
Si = J2j Wij , where Wij is the number of gap junctions 
between neurons i and j. On the other hand, the synaptic 
and combined networks are directed and the inward- and 
outward-strength of a node are defined as s™ — J2j Wji , 
and s° ut — ^2 j Wij, respectively. For directed networks, 
Wij represents the number of connections from neuron j 
to i. The figures indicate that scale- free behavior of the 
distributions is seen only for the gap-junctional network. 
The other two networks exhibit exponentially decaying 
nature for both the degree and the strength distributions. 

Figure S3. The intra- and inter-modular connectivity 
of individual neurons in the C. elegans somatic nervous 
systems, color-coded to represent the different ganglia in 
which each occurs, (a) The within module degree z-score 



of each neuron in the empirical neuronal network is shown 
against the corresponding participation coefficient P. (b) 
The corresponding result for a randomized version of the 
C. elegans network where the degree of each neuron is 
kept unchanged. The lateral ganglion is seen to occupy 
a prominent position in the system, coordinating the in- 
formation flow between the neuronal groups responsible 
for receiving sensory stimuli and those controlling motor 
activity. On the other hand, the randomized network 
shows similar prominence for several other ganglia. 

Table SI. The classification of neurons according to 
their membership in the 6 modules obtained by optimal 
partitioning of the combined synaptic-gap junctional net- 
work of the C. elegans somatic nervous system. 

Table S2. The neuronal composition of different func- 
tional circuits in the C. elegans somatic nervous system. 

Table S3. The classification of neurons in the C. el- 
egans somatic nervous system, according to their role in 
intra- and inter-modular connectivity. 

Text SI. Analysis of the C. elegans network of synap- 
tic connections. 
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